PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Aradu.YF20P
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Dalbergieae; Arachis
Family HD-ZIP
Protein Properties Length: 786aa    MW: 88276.2 Da    PI: 7.1059
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Aradu.YF20PgenomeNCGR_PGCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox629e-2088143156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  +++ +++t+ q++e+e+lF+++++p+ ++r +L+++lgL+ rqVk+WFqNrR+++k
  Aradu.YF20P  88 KKRYHRHTARQIHEMEALFKECQHPDDKQRLKLSHELGLKPRQVKFWFQNRRTQMK 143
                  688899***********************************************998 PP

2START149.33.4e-472895132206
                  HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS..............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
        START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.............dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                  la + ++elvk+  a+ep+W +     +gd  + +fee +               ++ea+r+++vv+m++++lv  +ld + +W e ++    +a+t+
  Aradu.YF20P 289 LAASSMAELVKMCHANEPLWIRGC---DGDREVLNFEEHGRlfpwplnlkhrtdLRTEASRDTAVVIMNSVTLVDAFLDAN-KWMELFPtivsRAKTV 382
                  577899****************99...5555555665544469**************************************.**999999999***** PP

                  EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE- CS
        START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdl 172
                  + iss+      g lqlm+ae+q+lsplv+ R+  f+Ry++   ++g+w ivd  +ds q++  ++s+ R  ++pSg+li++++ng+s+vtw+eh+++
  Aradu.YF20P 383 QIISSNasghasGSLQLMYAEFQVLSPLVStRETHFLRYCQHnAEEGTWGIVDFPIDSFQQNL-HPSYPRYCRRPSGCLIQDMPNGYSRVTWIEHAEV 479
                  ******************************************9******************98.9********************************* PP

                  -SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
        START 173 kgrlphwllrslvksglaegaktwvatlqrqcek 206
                  +++ +h+++   v sg+a+ga++w+  lqrqce+
  Aradu.YF20P 480 EEKPVHQIFSNYVYSGMAFGAQRWLGVLQRQCER 513
                  ********************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.34E-1966146IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.1E-2081151IPR009057Homeodomain-like
PROSITE profilePS5007116.89285145IPR001356Homeobox domain
SMARTSM003891.6E-1886149IPR001356Homeobox domain
CDDcd000862.59E-1788146No hitNo description
PfamPF000462.7E-1788143IPR001356Homeobox domain
PROSITE patternPS000270120143IPR017970Homeobox, conserved site
PROSITE profilePS5084843.226279516IPR002913START domain
SuperFamilySSF559612.88E-32280515No hitNo description
CDDcd088751.79E-116283512No hitNo description
SMARTSM002349.2E-30288513IPR002913START domain
PfamPF018523.8E-39289513IPR002913START domain
SuperFamilySSF559612.33E-18531764No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 786 aa     Download sequence    Send to blast
WFASTEETFS PQHKKKERRS SKNVWGLPSN VYNGRQRSCK LIRNTFLFSD PKQQLHLHAY  60
HAFSSLFSHH VCAVLEEETD QQQPQNKKKR YHRHTARQIH EMEALFKECQ HPDDKQRLKL  120
SHELGLKPRQ VKFWFQNRRT QMKAQQDRAE NVILRAENET LKNENYRLQA SLRSVICPNC  180
GGPGIMGADL GLDEQQLRIE NARLREELER VCCLTSRYSG RPIQTMAPAP ALMAPSLDLD  240
MNIYPRHYVD PTIPPCNEMI PVPMLPPEAS QFPEGGLLME EEKSLAMDLA ASSMAELVKM  300
CHANEPLWIR GCDGDREVLN FEEHGRLFPW PLNLKHRTDL RTEASRDTAV VIMNSVTLVD  360
AFLDANKWME LFPTIVSRAK TVQIISSNAS GHASGSLQLM YAEFQVLSPL VSTRETHFLR  420
YCQHNAEEGT WGIVDFPIDS FQQNLHPSYP RYCRRPSGCL IQDMPNGYSR VTWIEHAEVE  480
EKPVHQIFSN YVYSGMAFGA QRWLGVLQRQ CERVASLMAR NISDLGVIHS PEARKNLMKL  540
AQRMIRTFSL NMSTSSGQSW TAISDSPEDT VRITTRKITE PGQPNGVILG AVSTTWLPYS  600
HTKVFDLLRD ERHRSQMDAL SNGNSLNEVA HIANGSHPGN CISLLRINVA SNSSQNVELM  660
LQESCTDQSG SVVVYTTIAV DSIQVAMSGE DPSCIALLPQ GFIITPMPPS SNNSSSSEAL  720
INDHGCLLTV GLQILASTIP SAKLNLSSVT SITNHLCSTV HQIEASLCNT ANNEPTTTTA  780
PPKTVN
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015963259.10.0PREDICTED: LOW QUALITY PROTEIN: homeobox-leucine zipper protein ROC3-like
SwissprotA2ZAI70.0ROC3_ORYSI; Homeobox-leucine zipper protein ROC3
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLA0A0S3T3420.0A0A0S3T342_PHAAN; Uncharacterized protein
STRINGGLYMA09G34061.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF109463036
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7